Search results for "exploratory data analysis"

showing 10 items of 16 documents

Tracking archaeological and historical mines using mineral prospectivity mapping.

2014

13 pages; International audience; The present study proposes a technological transfer from modern mining prospection to the field of archaeology, providing a methodology to facilitate the discovery of ancient mining sites. This method takes advantage of the thousands of geochemical analyses of streambed sediments, performed by national geological surveys to inventory mineral substances. In order to delineate geochemical anomalies, the datasets are treated following two different approaches: Exploratory Data Analysis and a fractal-based method often recognised as more powerful. Mineral prospectivity maps are then obtained by combining the results with a geographical information system. The s…

010506 paleontologyArcheologyProspection[SHS.ARCHEO]Humanities and Social Sciences/Archaeology and Prehistory[SDE.MCG]Environmental Sciences/Global Changes010501 environmental sciences01 natural sciencesExploratory data analysisProspectivity mapping[SDU.STU.GC]Sciences of the Universe [physics]/Earth Sciences/GeochemistryMathematical skillInformation systemProspecting0105 earth and related environmental sciencesGeochemical landscapeMines[ SDU.STU.GC ] Sciences of the Universe [physics]/Earth Sciences/GeochemistryArchaeologyField (geography)Exploratory data analysisFractal model[ SDE.MCG ] Environmental Sciences/Global ChangesProspectionArchaeologyStreambed sediment[ SHS.ARCHEO ] Humanities and Social Sciences/Archaeology and PrehistoryGeographical information systemGeology
researchProduct

Development of Applications for Interactive and Reproducible Research: a Case Study

2016

For a proper understanding of the organization and regulation of gene expression, the computational analysis is an essential component of the scientific workflow, and this is particularly true in the fields of biostatistics and bioinformatics. Interactivity and reproducibility are two highly relevant features to consider when adopting or designing a tool, and often they can not be provided simultaneously.In this work, we address the issue of developing a framework that can provide interactive analysis, in order to allow experimentalists to fully exploit advanced software tools, as well as reproducibility as an internal validation of the analysis steps, by providing the underlying code and d…

BioconductorExploratory data analysisSoftwareInteractivityWorkflowExploitbusiness.industryComputer scienceComponent (UML)Big dataSoftware engineeringbusinessData scienceGenomics and Computational Biology
researchProduct

Infrared biospectroscopy for a fast qualitative evaluation of sample preparation in metabolomics.

2014

Liquid chromatography-mass spectrometry (LC-MS) has been increasingly used in biomedicine to study the dynamic metabolomic responses of biological systems under different physiological or pathological conditions. To obtain an integrated snapshot of the system, metabolomic methods in biomedicine typically analyze biofluids (e.g. plasma) that require clean-up before being injected into LC-MS systems. However, high resolution LC-MS is costly in terms of resources required for sample and data analysis and care must be taken to prevent chemical (e.g. ion suppression) or statistical artifacts. Because of that, the effect of sample preparation on the metabolomic profile during metabolomic method d…

ChromatographyPlasma samplesChemistryPlasma compositionIon suppression in liquid chromatography–mass spectrometryBlood ProteinsMass spectrometryMethod developmentMass SpectrometryAnalytical ChemistryMice Inbred C57BLExploratory data analysisMetabolomicsSpectroscopy Fourier Transform InfraredAnimalsMetabolomicsSample preparationFemaleChromatography LiquidTalanta
researchProduct

pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components

2019

AbstractBackgroundPrincipal component analysis (PCA) is frequently useentirely written ind in genomics applications for quality assessment and exploratory analysis in high-dimensional data, such as RNA sequencing (RNA-seq) gene expression assays. Despite the availability of many software packages developed for this purpose, an interactive and comprehensive interface for performing these operations is lacking.ResultsWe developed the pcaExplorer software package to enhance commonly performed analysis steps with an interactive and user-friendly application, which provides state saving as well as the automated creation of reproducible reports. pcaExplorer is implemented in R using the Shiny fra…

Computer scienceInterface (computing)ShinyBioconductorPrincipal component analysis610 MedizinRNA-SeqGenomicslcsh:Computer applications to medicine. Medical informaticsReproducible researchBioconductorTranscriptomeExploratory data analysisUser-friendly610 Medical sciencesGene expressionHumansRNA-SeqGenelcsh:QH301-705.5Data CurationBase Sequencebusiness.industrySequence Analysis RNARRNAReproducibility of Resultslcsh:Biology (General)Principal component analysisRNAlcsh:R858-859.7Software engineeringbusinessSoftware
researchProduct

MATLAB-based educational software for exploratory data analysis (EDA toolkit)

2009

This article presents an educational software developed in order to enable engineering students to gain insight into data sets via the exploratory data analysis (EDA). This software has been developed using the MATLAB GUIDE tool. This article shows the program suitability for learning EDA in different engineering courses related to data analysis such as data mining or data processing courses. © 2009 Wiley Periodicals, Inc. Comput Appl Eng Educ 20: 313–320, 2012

Data processingGeneral Computer ScienceComputer sciencebusiness.industryGeneral Engineeringcomputer.software_genreData scienceEducationExploratory data analysisSoftwareMATLABSoftware engineeringbusinesscomputerEducational softwarecomputer.programming_languageComputer Applications in Engineering Education
researchProduct

Empirical model evaluation and hypothesis testing

2016

Chapter 5 deals with the empirical model evaluation and the testing of hypotheses. It starts out with the evaluation of the measurement and the structural models, using the PLS algorithm. After the evaluation of the complete model, moderating effects are examined by conducting group comparisons (section 5.4.1) and by investigating interaction effects (5.4.2). After that, selected constructs are further examined by exploratory data analysis (section 5.5).

Exploratory data analysisBrand relationshipSection (archaeology)EconometricsGroup comparisonStatistical hypothesis testingMathematicsBrand loyalty
researchProduct

Basic Statistical Techniques

2012

Exploratory data analysisData collectionComputer scienceInterval estimationStatisticsData analysisStatistical inferenceSampling (statistics)Statistical and Managerial Techniques for Six Sigma Methodology
researchProduct

Exploratory data analysis of environmental governance at local level in the south-west region of Poland

2018

Exploratory data analysisGeographyEnvironmental governanceRegional scienceGeneral MedicineBiblioteka Regionalisty
researchProduct

Metabolic flux understanding of Pichia pastoris grown on heterogenous culture media

2014

[EN] Within the emergent field of Systems Biology, mathematical models obtained from physical chemical laws (the so-called first principles-based models) of microbial systems are employed to discern the principles that govern cellular behaviour and achieve a predictive understanding of cellular functions. The reliance on this biochemical knowledge has the drawback that some of the assumptions (specific kinetics of the reaction system, unknown dynamics and values of the model parameters) may not be valid for all the metabolic possible states of the network. In this uncertainty context, the combined use of fundamental knowledge and data measured in the fermentation that describe the behaviour…

Principal Component AnalysisbiologyMathematical modelManufacturing processComputer scienceProcess Chemistry and TechnologySystems biologyMonte Carlo samplingESTADISTICA E INVESTIGACION OPERATIVACellular functionsMetabolic networkMetabolic networkMissing-data methods for Exploratory Data AnalysisContext (language use)biology.organism_classificationINGENIERIA DE SISTEMAS Y AUTOMATICAComputer Science ApplicationsAnalytical ChemistryPichia pastorisEconometricsBiochemical engineeringPossibilistic consistency analysisFlux (metabolism)SpectroscopySoftware
researchProduct

Feature selection on a dataset of protein families: from exploratory data analysis to statistical variable importance

2016

Proteins are characterized by several typologies of features (structural, geometrical, energy). Most of these features are expected to be similar within a protein family. We are interested to detect which features can identify proteins that belong to a family, as well as to define the boundaries among families. Some features are redundant: they could generate noise in identifying which variables are essential as a fingerprint and, consequently, if they are related or not to a function of a protein family. We defined an original approach to analyze protein features for defining their relationships and peculiarities within protein families. A multistep approach has been mainly performed in R …

Quantitative Biology::Biomoleculesbusiness.industrySparse PCAPattern recognitionFeature selectionLinear discriminant analysisCross-validationRandom forestExploratory data analysisStatistical classificationArtificial intelligencebusinessCluster analysisMathematics
researchProduct